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ABSTRACT 

The primary challenge for experiments measuring the neutral hydrogen power spectrum from the 
Epoch of Reionization (EoR) are mode- mixing effects where foregrounds from very bright astrophysical 
sources interact with the instrument to contaminate the EoR signal. In this paper we identify a new 
type of mode-mixing that occurs when measurements from non-identical baselines are combined for 
increased power spectrum sensitivity. This multi-baseline effect dominates the mode-mixing power in 
our simulations and can contaminate the EoR window, an area in Fourier space previously identified 
to be relatively free of foreground power. 



1. INTRODUCTION 

Observing redshifted 21 cm neutral hydrogen emission 
from the Epoch of Reionization (EoR) is an exciting 
new tool for observational cosmology. These observa- 
tions have the potential to reveal the timing and du- 
ration of reionization, to constrain models of star and 
galaxy formation and to determine what kinds of sources 
dominate reionization (for reviews see Morales & Wyithe 
2010; Furlanetto et al. 2006). Several new arrays are un- 
der construction or have recently been built to observe 
the 21 cm power spectrum, including LOFAR (LOw Fre- 
quency ARray 2 ), PAPER (Precision Array for Probing 
the Epoch of Reionization 3 ) and the MWA (Murchison 
Widefield Array 4 ). 

The major challenge for EoR observations is the pres- 
ence of astrophysical foregrounds that are 4-5 orders of 
magnitude brighter than the EoR signal. Early work 
recognized that the EoR signal could in principal be sep- 
arated from the astrophysical foregrounds because the 
foregrounds have smooth spectra and so should be con- 
centrated in the first few line-of-sight Fourier modes (fey) 
while the EoR signal would extend up to much higher k\\ 
modes (Morales & Hewitt 2004; Zaldarriaga et al. 2004; 
Jelic et al. 2008; Wang et al. 2006; Harker et al. 2009). 
Unfortunately this picture is somewhat complicated by 
the interaction of the foregrounds with realistic instru- 
ments, which have frequency dependent responses. This 
interaction is called mode-mixing because the chromatic 
instrumental response can add spectral structure to the 
astrophysical foregrounds, throwing foreground power to 
higher k\\ modes and obscuring the EoR signal. A num- 
ber of recent papers have begun to shed light on these 
mode-mixing effects through detailed examinations of 
how foregrounds propagate through the instrument and 
analysis. 

Precision mode- mixing simulations by Datta et al. 
(2010) first identified a distinctive wedge shape in k 
space, with the mode- mixing predominantly below a 
fey oc k± line. Investigations of the response of single 
baselines to flat-spectrum foregrounds by several groups 
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(Morales et al. 2012; Trott et al. 2012; Vedantham et al. 
2012; Parsons et al. 2012) showed that the wedge shape is 
characteristic of smooth spectrum astrophysical sources 
interacting with the chromatic response of the baselines. 
This mode-mixing occurs because the baseline integrates 
over a region of the uv plane to form visibilities and 
the baseline length in wavelengths varies with frequency. 
This results in a slowly varying value for the visibility as 
the baseline moves through the uv space and this vari- 
ation bleeds into the line-of-sight direction because of 
the physical size of the integration region (see Morales 
et al. 2012 for a more complete mathematical and pic- 
torial explanation). The wedge shape comes about be- 
cause longer baselines change length more quickly than 
shorter baselines, so the power at larger k± is thrown 
into proportionally higher fey modes. These authors also 
identified a region called the "EoR window" at low k± 
and high fey that should be relatively free of this kind of 
contamination. 

While these single baseline effects are very important, 
the mode- mixed power in the Datta et al. (2010) simu- 
lations is dominated by a multi-baseline effect that we 
identify in this paper. Multi-baseline mode-mixing oc- 
curs when measurements from non-identical baselines are 
combined together to increase the power spectrum sen- 
sitivity, as they are for nearly all proposed observations. 
This new effect has a shape in k space that is similar 
to the previously identified single baseline mode-mixing, 
with most of the mode-mixed foreground power thrown 
into the wedge, but it also contaminates the EoR win- 
dow. The multi-baseline effect is present in all analyses 
that combine measurements from non-identical baselines 
or use images to make power spectra. 

In the next section we introduce the multi-baseline 
mode-mixing mechanism using a simplified simulation 
with only one foreground source and show that it is fun- 
damental to multi-baseline analyses. Then in section 3 
we develop a precision simulation of the power spectrum 
with realistic foregrounds and we conclude in section 4. 

2. MULTI-BASELINE MODE-MIXING 

Visibilities from identical baselines (baselines with the 
same length and orientation) can be added coherently be- 
cause they measure exactly the same angular modes on 
the sky. Coherent summing averages down the noise on 
the measurement, increasing the instrumental sensitivity 
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to the associated modes. Of course baselines that have 
very different lengths and orientations measure different 
angular sky modes and cannot be added coherently, but 
baselines that are similar but not identical are partially 
coherent, that is they measure some of the same modes 
but they are each sensitive to modes not detected by the 
other. In the uv plane, baselines integrate over a small 
region of the plane given by their power response, so 
the measured visibilities contain signals from a range of 
uv locations. For identical baselines the integration re- 
gion is the same, but for partially coherent baselines the 
integration regions only partially overlap. Partially co- 
herent visibilities contain separate measurements of the 
modes inside the overlap region, so it is common to co- 
herently add the visibilities in the overlapping region to 
decrease the noise in those areas. All EoR sensitivity cal- 
culations explicitly combine partially coherent baselines 
(Morales & Hewitt 2004; Morales 2005; Bowman et al. 
2006; McQuinn et al. 2006; Lidz et al. 2008; Beardsley 
et al. 2012b), as the alternative is a dramatic decrease in 
the potential sensitivity. 

Unfortunately, combining measurements from non- 
identical baselines also introduces an additional type of 
mode-mixing. The reconstructed signal at a particular 
uv point contains co-added signal from an area of the uv 
plane given by the integration areas of all the co-added 
baselines. Because the locations of the baselines in the 
uv plane vary with frequency, the uv integration pattern 
for the reconstructed signal at the uv point also varies 
with frequency, introducing spectral ripple not present 
in the foregrounds themselves. 

A visibility measurement is described by equation (12) 
of Morales & Matejek (2009) 

m([v,/]) = B([v,/],[u,/])F([u,/],[^/])/([^/]) 

+ n([v,/]) (1) 

which we have adapted to explicitly note the frequency 
dimension. This equation describes the integral that is 
being done by the instrument in the uv plane, the inte- 
gral of the baseline power response (B) multiplied by the 
Fourier transform of the true sky brightness (FJ), and 
is mathematically identical to equation (1-13) in Clark 
(1999) up to the thermal noise term (n). B is only non- 
zero over a relatively compact region of the uv plane, so 
it defines the integration region for the baseline. It is 
important to note that neither the signal nor B is flat 
over the integration region. The variation in B over that 
region means that some areas of the uv plane contribute 
more strongly to the visibility than others and the inte- 
gration over the signal variation leads to information loss 
on small uv scales (this is the source of the single-baseline 
mode- mixing described earlier). 

For each uv location of interest, visibilities from all the 
overlapping baselines need to be added using a weighted 
sum. The reconstructed power at the uv location (u^) is 
given by 

7([u J) /])=B([u i ,/],[v ) /]) T B([v,/],[u ) /]) 

xF([u, /],[#, /])/([#,/]) (2) 

where B is the baseline weighting function (and we have 
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dropped the noise term for clarity). B B then records 



how much signal from each uv location is mapped to the 
reconstructed power at u^. The movement of baselines 
in the uv plane with frequency generates significant fre- 
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quency dependence in B B because which baselines con- 
tribute to u^, and the locations of those baselines relative 
to u^, is strongly frequency dependent. This frequency 
dependence in the signal mapping generates frequency 
dependence in J even if the true sky brightness is spec- 
trally smooth. Since the power spectra are generated 
by Fourier transforming in frequency along each u^, the 
spectral ripple introduced by combining partially coher- 
ent baselines throws power into high k\\ modes. 

2.1. Multi-Baseline Simulation 

To illustrate this effect, we developed precision sim- 
ulations based on software holography/A projection 
(Morales & Matejek 2009; Bhatnagar et al. 2008) where 
the visibilities are gridded using the antenna response 
function (B is just a sampled version of B). We use this 
approach because it can be shown to be information loss- 
less (Tegmark 1997a,b), but the results are general to any 
analysis that combines measurements from non-identical 
baselines. Following Datta et al. (2010), we used the 
array layout from the originally proposed MWA instru- 
ment (Beardsley et al. 2012a) with 512 square antennas 
(Tingay et al. 2013). 

Figure 1 demonstrates the effect of multi-baseline 
mode-mixing using a simplified foreground with a single 
fiat-spectrum point source and focusing on one location 
in the uv plane. In the bottom panel the true phase of the 
source at the selected uv location is shown in black (ref- 
erenced to the lowest frequency). The red line shows the 
reconstructed phase at that location as a function of fre- 
quency, exhibiting obvious frequency structure absent in 
the true foreground signal. We have selected three repre- 
sentative frequencies, indicated by the dashed blue lines, 
picking frequencies with almost no phase error, large neg- 
ative and large positive error (left to right). Our selected 
uv point is in the lower left-hand quadrant of the uv plane 
and the upper nine panels zoom into that point, marked 
by a red cross, at each of the chosen frequencies. 

In the top row, the integration regions for all the base- 
lines that contribute to the uv point at any frequency 
are plotted in light grey. The baselines that contribute 
at the selected frequency are outlined in black, defining 
the region of the uv plane that contributes to the recon- 
structed signal at our point. The set of baselines that 
contribute change with frequency because the baseline 
length increases with frequency (baselines move to the 
lower left in subsequent columns) and the integration re- 
gions for the other columns are shown in middle grey. 

~ T 

The blue heat map shows normalized B B or the rela- 
tive influence of the areas within the integration region 
on the reconstructed signal at our uv point. Each vis- 
ibility is formed by integrating over an area in the uv 
plane with the antenna response function and then the 
visibilities are weighted by the response at the uv point 
and averaged to determine the best estimate of the sig- 
nal there. In other words, the reconstructed signal is a 
weighted average of the contributing visibilities. The in- 
fluence map is the effective weight of each location within 
the integration region on the reconstructed signal. 

The second row shows the true phase of our foreground 
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Fig. 1. — This figure demonstrates the effect of multi-baseline mode-mixing in the simplified case of one flat spectrum source at a single 
location in the uv plane. The bottom panel shows the true (flat black line) and reconstructed (red line) phase of the source (relative to 
the lowest frequency) at the uv location with the selected frequencies marked with blue dashed lines. The upper nine panels zoom into 
an area around the uv point at the three selected frequencies. The uv point is in the lower left-hand quadrant of the uv plane (uv = 
is far to the upper right) and is marked by a red cross in each panel. The top row shows the integration areas for all the baselines that 
contribute to the uv point at any frequency in light grey and the set of baselines that contribute at the selected frequencies are outlined in 
black, defining the region that contributes to the uv point. The integration areas are given by the response functions of the square MWA 
antennas. Baseline lengths increase with frequency, moving to the lower left in subsequent columns, and the integration regions for the 
other columns are shown in middle grey. The blue heat map shows the relative influence of areas within the integration region on the 
reconstructed signal. The second row shows the true phase of the foreground source with the integration region and the centers of the 
contributing baselines marked in black. In the third row the influence map from the top row is overlaid on the true phase from the second 
row. This figure is discussed in detail in section 2. 
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source with the integration region and the centers of the 
contributing baselines marked in black (each dot corre- 
sponds to a grey square in the top row). Finally in the 
third row we have overlaid the influence map from the top 
row on the phase from the second row. For the left-hand 
column the phase is almost perfect because there are two 
visibilities that are nearly on top of our uv point, domi- 
nating the influence map, and the contributions from the 
other three nearly cancel out. In the center column most 
of the visibilities are at lower phase and the closest base- 
lines, which dominate the influence map, are clustered 
just to the left and down from the uv point. In the third 
column the number of close visibilities above and below 
the uv point are nearly equal and the influence map is 
double-lobed but skewed slightly toward higher phase. 

The variation in the reconstructed signal with fre- 
quency is caused by the dithering of which baselines con- 
tribute and how strongly they contribute (i.e. the influ- 
ence map) as a function of frequency. The shape of the 
multi-baseline mode-mixing ripple is different in each uv 
pixel and depends on the details of the foregrounds and 
on the exact locations of the baselines contributing to 
each location, but unlike the mode-mixing discussed in 
previous work (Morales et al. 2012; Trott et al. 2012; 
Vedantham et al. 2012; Parsons et al. 2012), the spatial 
frequency (fcy) of this ripple is not limited by the field of 
view of the instrument so it can throw power into higher 
fey modes, including into the EoR window. 

2.2. Maybe we should.... 

While we have used gridding to show the origin of this 
effect graphically, multi-baseline mode- mixing is inherent 
in any approach that combines non-identical baselines 
to create power spectra. However the reasons for this 
are subtle and it is natural to suggest different analysis 
approaches to get around the problem. Common ideas 
include: 

Maybe we should use a different gridding kernel. The 
effect of using a different gridding kernel in this case is 
simply to change the weighting of the different visibilities 
in the reconstructed signal. However the influence map 
is a combination of the integral done by the instrument 
to form the visibility and the gridding kernel, so it won't 
change dramatically and there will still be shifts in the 
map that will lead to mode-mixing. In addition, using 
the antenna response function as the gridding kernel is 
optimal (Tegmark 1997b) and using any other gridding 
kernel would result in a loss of sensitivity to the EoR 
signal. 

Could we use different antenna elements to decrease 
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this effect? The frequency dependence of B B is domi- 
nated by the change in the baseline length as a function 
of frequency, not by the frequency dependence of the an- 
tenna response, so different antenna elements will not 
help much. The best way to mitigate this effect when 
designing an instrument is to have a smooth, dense base- 
line distribution, especially for small uv modes where the 
EoR signal is the strongest, because the accuracy of the 
reconstructed power is proportional to the density of the 
uv coverage. 

What if we changed the visibility weighting to make the 
influence map frequency independent? It is not possible 
to weight the visibilities to achieve a frequency indepen- 



dent influence map because most of the integration is 
done by the instrument to form the visibilities. For in- 
stance in the three maps shown in figure 1, the black out- 
lines show the areas of the uv plane that were integrated 
by instrument. With only 5-8 baselines that overlap the 
uv point at each frequency, there is clearly no way to 
weight them to create identical influence maps. 

What about calculating the signal at each uv point with- 
out gridding? This calculation would involve a weighted 
sum of the visibilities and suffers from the same mode- 
mixing as in the gridding approach because the locations 
of the contributing baselines vary with frequency. In- 
deed the gridding approach is mathematically identical 
to a weighted-sum of visibilities at each uv point. 

What if we only calculated the power spectrum at the 
baseline centers? To measure the power spectrum we 
need a measurement at each frequency for every uv point 
we are using. Since the baselines move in uv with fre- 
quency, there are no uv locations that have a baseline 
precisely centered on them at every frequency, leaving 
no visibilities to use in the analysis. To increase the data 
we might consider including baselines that are nearly 
centered on the uv location, effectively setting a radius 
around the uv point. This scenario puts us right back 
in the mode-mixing regime, however, because there will 
be some variation in the locations of the baselines we 
choose with frequency. The down selection in baselines 
used in this case would also result in a huge decrease in 
sensitivity. 

Can we design an array (or chose baselines) to avoid 
this kind of mode-mixing by having no partially coherent 
baselines? Even if there were no baselines that were 
partially coherent at any frequency, the baselines that 
contributed to the same location at different frequencies 
would have different offsets from the uv point, so the 
influence map would still vary with frequency. In fact 
our simulations show that the amount of contamination 
from mode-mixing is decreased in regions with densely 
spaced baselines and is much worse in areas with very 
few partially coherent baselines. 

Combining a sparse reduntant array and the delay 
spectrum, as pioneered by PAPER (Parsons et al. 2012), 
does avoid the multi-baseline mode-mixing effect because 
visibilities from non-identical baselines are never com- 
bined and the Fourier transform is along the baseline 
track through the uvf cube, not along the frequency 
axis. However, this approach also leads to a significantly 
smaller EoR window and, for the same collecting area, 
correspondingly lower EoR sensitivity. 

The key issue for multi-baseline analyses using the full 
EoR window is that the instrument does a weighted inte- 
gral in the uv plane that cannot be reversed and the only 
way to estimate the signal at any particular uv point is to 
use the measurements from nearby baselines. Combin- 
ing these measurements with a weighted average based 
on the antenna response function (either directly or by 
gridding) will provide the highest EoR sensitivity, but 
any approach will suffer from mode-mixing due to the 
dithering of the contributing baselines. 

3. POWER SPECTRA 

In figure 2 we show the extent of the mode-mixing con- 
tamination in k space with more realistic foregrounds. 
The foreground sources for this simulation are con- 
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structed using the differential source counts from the 6th 
Cambridge survey at 151 MHz (Hales et al. 1988) with 
a flux density range of of 0.1 - 1 Jy, resulting in approx- 
imately five thousand sources randomly located in a 30 
x 30 deg 2 field. The flux range was chosen to represent 
the dominant sources that would remain after traditional 
image-based deconvolution of the brightest sources. 

This power spectrum was constructed by Fourier trans- 
forming in the frequency direction and then calculat- 
ing a three dimensional power spectrum estimator using 
~ T 

weights derived from B B. A full description of this esti- 
mator will be described in an upcoming paper (Hazelton 
et al. in preparation), but briefly it accounts for the dif- 
ference in sensitivity of the two Fourier components (i.e. 
sine and cosine) for each mode and marginalizes over the 
covariance between different modes. The power spec- 
trum is reduced to the two dimensions shown here by 
doing a variance weighted average along annuli in the k x 
and k y directions. 

This power spectrum is similar to those shown in Datta 
et al. (2010), with a clear wedge shape and a less contam- 
inated EoR window (above the dashed line). The lowest 
fen mode shown is the flat-spectrum (fey = 0) mode, which 
would contain all the foreground power if there were no 
mode-mixing. Our simulations naturally include the sin- 
gle baseline effects identified by Morales et al. (2012), 
Trott et al. (2012), Vedantham et al. (2012) and Parsons 
et al. (2012) because we form visibilities as described 
in equation (1), but we find that the multi-baseline ef- 
fects dominate the mode-mixed power throughout the uv 
plane. The contamination that is present in the EoR win- 
dow is a signature of multi-baseline mode-mixing which 
can produce contamination to arbitrarily high fey . 

Fortunately there are two factors that mitigate the 
multi-baseline contamination at low fcj_, leaving the 
mode-mixed power in the EoR window lower than in the 
wedge. The first effect is that the shorter baselines that 
contribute at low k± move more slowly in uv space with 
frequency so there is less severe mode-mixing because 
the distribution of baselines contributing to a particular 
uv location varies more slowly. In addition, simulations 
show that areas with denser and smoother uv coverage 
are less badly affected by this type of mode-mixing, so 
a dense central core of baselines can significantly reduce 
the amount of contamination at low k±. A dense cen- 
tral core is also desirable for increasing the sensitivity 
of arrays to the EoR signal, so it is a common feature 
among most of the arrays currently being built to detect 
the EoR. 

4. CONCLUSION 

We have identified a new type of mode-mixing that 
occurs when measurements from non-identical baselines 
are combined coherently to increase instrumental sensi- 
tivity to the EoR power spectrum. Multi-baseline mode- 
mixing is present in all analyses that combine measure- 
ments from non-identical baselines or use images to make 
power spectra, and although the majority of the con- 
tamination is in the wedge it also contaminates the EoR 
window. 

The amplitude of the contamination is significantly 
lower for regions with smooth, dense uv coverage. It 
is the dominant mode-mixing term in our simulations 
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Fig. 2. — Power spectrum showing the mode-mixing contamina- 
tion in a simulation with approximately five thousand point sources 
between 0.1 - 1 Jy (see section 3 for details). The EoR window is 
above and to the left of the black dashed line. 



using the originally proposed MWA array with 512 an- 
tennas, a layout with remarkably smooth uv coverage in 
its core (Beardsley et al. 2012a). While the amplitude 
of the multi-baseline mode-mixing effect depends on the 
specifics of each instrument and can only be determined 
through precision simulations, the dominance for the 512 
antenna MWA suggests that it will likely be the primary 
mode-mixing effect for the first generation of EoR instru- 
ments that use multi-baseline analyses. 

Now that we understand the sources of the mode- 
mixing foreground identified by Datta et al. (2010), the 
natural question becomes how can we mitigate them. In 
the next installment of our informal series on the mode- 
mixing foreground we will explore how to mitigate the 
multi-baseline mode-mixing foreground. Conceptually, 
the frequency structure of mode-mixing contamination 
has a characteristic shape at each point given by the 
arrangement of the baselines, the antenna response, and 
the local uv foreground. Unfortunately we cannot just 
fit for the mode-mixing because the frequency shape de- 
pends non-linear ly on the foregrounds, but we can solve 
the problem iteratively given a good model of the instru- 
ment. Because we will only subtract shapes typical of the 
instrumental response, we will preferentially remove the 
mode-mixing terms while leaving as much of the EoR sig- 
nal as possible. In spirit this approach approximates the 
covariance inversion of Liu & Tegmark (2011) and Dillon 
et al. (2012), while being computationally straightfor- 
ward. 

It is not yet clear whether experiments based on single- 
baseline power spectrum analyses (e.g. PAPER) with 
lower levels of mode-mixing and a smaller EoR window 
or those opting for the higher sensitivity multi-baseline 
approach with increased mode-mixing (MWA, LOFAR) 
will be better able to detect the EoR power spectrum. 
The outcomes of the currently planned experiments will 
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inform the next generation of Hydrogen Epoch of Reion- 
ization Arrays (HERA). 
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